On the Normalization and Visualization of Author Co-Citation Data Salton's Cosine versus the Jaccard Index

نویسنده

  • Loet Leydesdorff
چکیده

The debate about which similarity measure one should use for the normalization in the case of Author Co-citation Analysis (ACA) is further complicated when one distinguishes between the symmetrical co-citation—or, more generally, co-occurrence— matrix and the underlying asymmetrical citation—occurrence—matrix. In the Web environment, the approach of retrieving original citation data is often not feasible. In that case, one should use the Jaccard index, but preferentially after adding the number of total citations (occurrences) on the main diagonal. Unlike Salton’s cosine and the Pearson correlation, the Jaccard index abstracts from the shape of the distributions and focuses only on the intersection and the sum of the two sets. Since the correlations in the cooccurrence matrix may partially be spurious, this property of the Jaccard index can be considered as an advantage in this case.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RESYGEN: A Recommendation System Generator using domain-based heuristics

The relation between Pearson's correlation coefficient and Salton's cosine measure is revealed based on the different possible values of the division of the-norm and the-norm of a vector. These different values yield a sheaf of increasingly straight lines which form together a cloud of points, being the investigated relation. The theoretical results are tested against the author co-citation rel...

متن کامل

Citation Review and Scientific Visualization of Articles Published in the Iranian Rehabilitation Journal (IRJ) 2003-2023 in the Scopus Database

Objective: Accurate scientific planning and societal macro policies require reviewing and evaluating research output. Scientometrics offers a valuable approach for assessing the activity of journals that publish a majority of scientific productions. This study aims to analyze the scientific activity of the Iranian Rehabilitation Journal (IRJ) by examining its publication history in the Scopus d...

متن کامل

The Normalization of Occurrence and Co-occurrence Matrices in Bibliometrics using Cosine Similarities and Ochiai Coefficients

We prove that Ochiai similarity of the co-occurrence matrix is equal to cosine similarity in the underlying occurrence matrix. Neither the cosine nor the Pearson correlation should be used for the normalization of co-occurrence matrices because the similarity is then normalized twice, and therefore over-estimated; the Ochiai coefficient can be used instead. Results are shown using a small matri...

متن کامل

Social Network Analysis Using Author Co-Citation Data

This study examines the social network of scholars in the field of Communication by using author co-citation data. A matrix containing the number of co-cited documents between pairs of authors is created for social network analysis of scholars who are on the editorial board of Journal of Communication, and the networked map of the scholars is used to visualize the knowledge structure of the fie...

متن کامل

Co-occurrence Matrices and their Applications in Information Science: Extending ACA to the Web Environment

Co-occurrence matrices, such as co-citation, co-word, and co-link matrices, have been used widely in the information sciences. However, confusion and controversy have hindered the proper statistical analysis of this data. The underlying problem, in our opinion, involved understanding the nature of various types of matrices. This paper discusses the difference between a symmetrical co-citation m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JASIST

دوره 59  شماره 

صفحات  -

تاریخ انتشار 2008